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WO 99/49059 PCT/IL99/001 66 

METHODS OF CANCER DIAGNOSIS USING A CHIMERIC TOXIN 

1. INTRODUCTION 

The present invention relates to methods for cancer 
diagnosis using a chimeric toxin. In particular, the 
invention relates to the use of a chimeric toxin composed of 
gonadotropin releasing hormone (GnRH) and Pseudomonas 

exotoxin A (PE) to detect a tumor- associated epitope 
expressed by human adenocarcinomas. Mutated GnRH- PR 
molecules that bind but do not kill tumor cells are 
20 exemplified. 

2. BACKGROUND OF THE INVENTION 

GnRH is a decapeptide produced by hypothalamic neurons 
and secreted into the hypophysioportal circulation via portal 
vessels. It is first synthesized as a larger precursor 

15 protein which is processed by proteolytic cleavage and 
amidation at its C-terminal glycine. GnRH stimulates 
gonadotroph cells in the anterior pituitary gland to release 
luteinizing hormone and follicle-stimulating hormone, thereby 
regulating the hypothalamic -pituitary gonadal control of 
human reproduction. 

20 The involvement of GnRH has been implicated in certain 

carcinomas, and GnRH analogues have been used in the 
treatment of breast, prostatic, pancreatic, endometrial and 
ovarian cancers (Kadar et al., 1988, Prostate 12:229-307). 

The analogues suppressed tumor cell growth in vitro and in 
^ vivo. In addition, GnRH binding sites have been reported in 
certain solid tumors and in established cell lines (Emons et 
al. # 1993, J. Clin. Endocrinol. Metab. 77:1458-1464), though-, 
preliminary results suggest that the GnRH receptor (GnRHR) 
involved might differ from the previously documented receptor 
(Kadar et al . , 1992, Biochem. Biophs. Res. Comm. 189:289- 
30 295). 

Although GnRH binding 'sites have been demonstrated in 
tumors, such tumors were derived mainly from hormone 
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dependent tissues. Recently, Nechushtan et al. reported that 

certain hormone non- responsive tumors such as colon 
carcinomas, renal cell carcinomas and hepatocellular 
carcinomas were susceptible to killing by a chimeric toxin, 
GnRH-PE (J. Biol. Chem. , 1997, 272:11597). GnRH caused the 
chimeric toxin to bind to GnRHR- expressing tumor cells, 
whereas PE mediated cell killing by inhibiting protein 
synthesis. However, prior to the present invention, it was 
not known whether the observed effects were due to the 
expression of a natural. GnRHR :byr hormone non -responsive 
tumors or a new epitope recognized by GnRH-PE that was 
distinct from that bound by GnRH. 

3. SUMMARY OF THE INVENTION 

The present invention relates to methods for detecting a 
tumor cell using a GnRH-PE chimeric toxin, and GnRH-PE 
chimeric toxins that bind but do not kill tumor cells. In 
particular, it relates to the use of a GnRH-PE chimeric toxin 
to detect an epitope expressed by adenocarcinomas. For the 
practice of the invention, it is preferred that the GnRH-PE 
is modified to reduce its cytotoxic activities without 
altering its binding specificity to tumor cells. Such 
20 molecules are particularly useful for the detection of tumor 
cells in a biological specimen and in a human subject who has 
cancer . 

The invention is based, in part, on Applicants 1 
discovery that two mutated recombinant chimeric toxins 
composed of GnRH and PE, referred to as LGnRH-PE40M and 
25 LGnRH-PE66M, bind to tumor cells without killing them. Since 
these chimeric toxins do not bind granulosa tumor cells which 
express natural GnRHR recognized by GnRH, the chimeric toxins 
of the invention recognize a new tumor-associated epitope 
expressed by adenocarcinomas. 

3Q 4. BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1A Nucleotide . sequence (SEQ ID N0:1) and 

- 2 - 



SUBSTITUTE SHEET (RULE 26) 



WO 99/49059 



PCT/IL99/0D166 



10 



15 



20 



and IB. amino acid sequence (SEQ ID NO: 2) of LGnRH - 
PE66. Amino acid residue #575 identified 
within a square is deleted in a mutated 
chimeric toxin, LGnRH - PE6 6M . 

Figure 2. Nucleotide sequence (SEQ ID NO: 3) and amino 
acid sequence (SEQ ID NO:4) of LGnRH- PE40 . 
Amino acid residue #336 identified within a 
square is deleted in a. mutated chimeric 
toxin, LGnRH- PE40M. 

Figure 3 Mutated GnRH-PE- chimeric toxins, LGnRH - PE4 QM 
and LGnRH- PEG 6M, did not exhibit ADP- 
ribosylation activities. 

Figure 4. Mutated GnRH-PE chimeric toxins, LGnRH -PE4 0M 
and LGnRH- PE66M, did not inhibit protein 
synthesis in 293 renal carcinoma cells, while 
the non-mutated chimeric toxins showed 
cytotoxic activities. Inhibition of protein 
synthesis is used as an indication of 
cytotoxicity. 

Figure 5. GnRH-PE chimeric toxins did not inhibit 
protein synthesis of primary cultures of 
granulosa tumor cells which expressed natural 
GnRHR. 



25 



5. DETAILED DESCRIPTION OF THE INVENTION 

5.1. PRODUCTION OF GnRH-PE CHIMERIC TOXINS 

While the GnRH-PE chimeric toxins of the present 4 
invention may be produced by chemical synthetic methods or by 
chemical linkage between the two moieties, it is preferred 
that they are produced by fusion of a coding sequence for 
GnRH and a coding sequence for PE under the control of a 
regulatory sequence which directs the expression of the 
fusion polynucleotide in an appropriate host cell (Nechushtan 
et al., 1997, J. Biol. Chem. 272:11597). The fusion of two 

30 coding sequences can be achieved by methods well known in the 
art of molecular biology. The PE coding sequence suitable 
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for use in the present invention, includes but is not limited 
to, full length PE, partial fragments of PE such as domains 
II and/or III of PE, mutated PE in which amino acid residues 
in domain I have been altered to reduce non-specific 
cytotoxicity and mutated PE which has minimal cytotoxic 
5 activities (United States Patent No. 4,892,827, Lorberboum- 
Galski et al., 1990, J. Biol. Chem. 265:16311). 

It is preferred that a fusion polynucleotide contain 
only the AUG translation initiation codon at the 5* end of 
the first coding sequence without tfi'e initiation codori of the 
second coding sequence to avoid the production of two encoded 

10 products. In addition, a leader sequence may be placed at 
the 5 1 end of the polynucleotide in order to target the 
expressed product to a specific site or compartment within a 
host cell to facilitate secretion or subsequent purification 
after gene expression. The two coding sequences can be fused 
directly without any linker or by using a flexible polylinker 

15 composed of the pentamer Gly-Gly-Gly-Gly-Ser (SEQ ID NO: 5) 
repeated 1 to 3 times. Such linker has been used in 
constructing single chain antibodies (scFv) by being inserted 
between V H and V L (Bird et al . , 1988, Science 242:423-426; 

Huston et al., 1988, Proc. Natl. Acad. Sci . U.S.A. 85:5979- 

20 5883) . The linker is designed to enable the correct 

interaction between two beta -sheets forming the variable 
region of the single chain antibody. Other linkers which may 
be used include Glu-Gly-Lys-Ser-Ser-Gly-Ser-Gly-Ser-Glu-Ser- 
Lys-Val-Asp (SEQ ID NO: 6) (Chaudhary et al . , 1990, Proc. 

Natl. Acad. Sci. U.S.A. 87:1066-1070) and Lys-Glu-Ser-Gly- 
25 Ser-Val-Ser-Ser-Glu-Gln-Leu-Ala-Gln-Phe-Arg-Ser-Leu-Asp (SEQ 
ID N0:7) (Bird et al . , 1988, Science 242:423-426). 

5.2. EXPRESSION OF GnRH-PE CHIMERIC TOXINS 

A polynucleotide which encodes a GnRH-PE chimeric 
toxin, mutant polypeptides, biologically active fragments of 

30 

chimeric protein, or functional equivalents thereof, may be 

- 4 - 
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used to generate recombinant DNA molecules that direct the 
expression of the chimeric toxin, mutant polypeptides, 
peptide fragments, or a functional equivalent thereof, in 
appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences which encode 
5 substantially the same or a functionally equivalent amino 
acid sequence, may be used in the practice of the invention 
for the cloning and expression of the chimeric toxin. 

Altered DNA sequences which may be used in accor- 
dance with the invent ion . include deletions, additions, or 
substitutions of different nucleotide residues resulting in a 

10 sequence that encodes the same or a functionally equivalent 
fusion gene product. The gene product itself may contain 
deletions, additions or substitutions of amino acid residues 
within a chimeric sequence, which result in a silent change 
thus producing a functionally equivalent chimeric protein. 
Such amino acid substitutions may be made on the basis of 

15 similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues 
involved. For example, negatively charged amino acids 
include aspartic acid and glutamic acid; positively charged 
amino acids include lysine, histidine and arginine; amino 
acids with uncharged polar head groups having similar 

20 hydrophilicity values include the following: glycine, 

asparagine, glutamine, serine, threonine, tyrosine; and amino 
acids with nonpolar head groups include alanine, valine, 
isoleucine, leucine, phenylalanine, proline, methionine, 
tryptophan . 

The DNA sequences of the invention may be engi- 
25 neered in order to alter a chimeric coding sequence for a 
variety of ends, including but not limited to, alterations 
which modify processing and expression of the gene product. 
For example, mutations may be introduced using techniques 
which are well known in the art, e.g., site-directed 

mutagenesis, to insert new restriction sites, to reduce 

30 ... 

cytotoxicities , etc . 

- 5 - 
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In an alternate embodiment of the invention, the 
coding sequence of the GnRH-PE chimeric toxin could be 
synthesized in whole or in part, using chemical methods well 
known in the art. See, for example, Caruthers et al. # 1980, 
Nuc. Acids Res. Symp. Ser. 7:215-233; Crea and Horn, 180, 

5 

Nuc. Acids Res. 9 (10) :2331; Matteucci and Caruthers, 1980, 

Tetrahedron Letter 21:719; and Chow and Kempe, 1981, Nuc. 

Acids Res. 9 (12) :2807-2817 . In addition, GnRH decapeptide 

and specific domains of PE. : can be synthesized by solid phase 
techniques, cleaved from the resin, and purified by 
10 preparative high performance liquid chromatography followed 
by chemical linkage to form a chimeric toxin (e.g., see 

Creighton, 1983, Proteins Structures And Molecular 

Principles, W.H. Freeman and Co., N.Y. pp. 50-60). The 

composition of the synthetic peptides may be confirmed by 
15 amino acid analysis or sequencing (e.g., the Edman 

degradation procedure; see Creighton, 1983, Proteins, 
Structures and Molecular Principles, W.H. Freeman and Co., 
N.Y., pp. 34-49) . Alternatively, the GnRH and PE produced by 
synthetic or recombinant methods may be conjugated by 
chemical linkers according to methods well known in the art 

20 

(Brinkmann and Pastan, 1994, Biochemica et Biophysica Acta 
1198:27-45) . 

In order to express a biologically active GnRH-PE 
chimeric toxin, the nucleotide sequence coding for a chimeric 
toxin, or a functional equivalent, is inserted into an 
appropriate expression vector, i.e., a vector which contains 

25 

the necessary elements for the transcription and translation 
of the inserted coding sequence. The chimeric toxin as well- 
as host cells or cell lines transfected or transformed with 
recombinant chimeric expression vectors can be used for a 
variety of purposes. These include but are not limited to 
30 generating antibodies (i.e., monoclonal or polyclonal) that 

- 6 - 
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bind to epitopes of the proteins to facilitate their 
purification. 

Methods which are well known to those skilled in 
the art can be used to construct expression vectors 
containing the GnRH-PE chimeric toxin coding sequence and 
5 appropriate transcriptional/translational control signals. 
These methods include in vitro recombinant DNA techniques, 

synthetic techniques and in vivo recombination/genetic 

recombination. See, for example, the techniques described in 
Sairjbrbok et al. # 1989, Molecular Cloning A Laboratory "Manual f 

10 Cold Spring Harbor Laboratory, N.Y. and Ausubel et al., 1989, 

Current Protocols in Molecular Biology, Greene Publishing 
Associates and Wiley Interscience, N.Y. 

A variety of host -expression vector systems may be 
utilized to express the GnRH-PE chimeric protein coding 
sequence. These include but are not limited to 

15 microorganisms such as bacteria transformed with recombinant 
bacteriophage DNA, plasmid DNA or cosmid DNA expression 
vectors containing the chimeric toxin coding sequence; yeast 
transformed with recombinant yeast expression vectors 
containing the chimeric toxin coding sequence; insect cell 
systems infected with recombinant virus expression vectors 

20 ( e.g. , baculovirus) containing the chimeric toxin coding 

sequence; plant cell systems infected with recombinant virus 
expression vectors ( e.g. . cauliflower mosaic virus, CaMV; 
tobacco mosaic virus, TMV) or transformed with recombinant 
plasmid expression vectors ( e.g. , Ti plasmid) containing the 
chimeric toxin coding sequence; or animal cell systems. It 

25 should be noted that since PE normally kills mammalian cells, 
it is preferred that the chimeric toxins of the invention be 
expressed in prokaryotic or lower eukaryotic cells. Section^ 
6 illustrates that GnRH-PE chimeric toxins can be efficiently 
expressed in E. coli. However, since the mutated GnRH-PE 

chimeric toxins in Section 6, infra, do not exhibit cytotoxic 
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activities towards human cells, they may be expressed in 
eukaryotic cells as well. 

The expression elements of each system vary in 
their strength and specificities. Depending on the 
host/vector system utilized, any of a number of suitable 
5 transcription and translation elements, including 

constitutive and inducible promoters, may be used in the 
expression vector. For example, when cloning in bacterial 
systems, inducible promoters such as pL of bacteriophage X, 

... plac, ptrp-, ptac. (ptrp-lac hybrid promoter; cytomegalovirus 
promoter) and the like may be used; when cloning in insect 

10 cell systems, promoters such as the baculovirus polyhedrin 
promoter may be used; when cloning in plant cell systems, 
promoters derived from the genome of plant cells ( e.g. , heat 
shock promoters; the promoter for the small subunit of 
RUBISCO; the promoter for the chlorophyll a/3 binding 
protein) or from plant viruses ( e.g. , the 35S RNA promoter of 

15 CaMV; the coat protein promoter of TMV) may be used; when 

cloning in mammalian cell systems, promoters derived from the 
genome of mammalian cells ( e.g. , metallothionein promoter) or 
from mammalian viruses ( e.g. , the adenovirus late promoter; 
the vaccinia virus 7.5K promoter) may be used; when 
generating cell lines that contain multiple copies of the 

20 chimeric DNA, SV40-, BPV- and EBV-based vectors may be used 
with an appropriate selectable marker. 

In bacterial systems a number of expression vectors 
may be advantageously selected depending upon the use 
intended for the chimeric toxin expressed. For example, when 
large quantities of chimeric toxin are to be produced, 

25 vectors which direct the expression of high levels of protein 
products that are readily purified may be desirable. Such 
vectors include but are not limited to the pHL906 vector 
(Fishman et al . , 1994, Biochem. 33:6235-6243), the E. coli 

expression vector pUR278 (Ruther et al., 1983, EMBO J. 

3 0 2:1791), in which the chimeric protein coding sequence may be 
ligated into the vector in frame with the lacZ coding region 

- 8 - 
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so that a hybrid lacZ protein is produced; pIN vectors 
(Inouye & Inouye, 1985, Nucleic acids Res. 13:3101-3109; Van 
Heeke & Schuster, 1989, J. Biol. Chem. 264:5503-5509); and 
the like. 

An alternative expression system which could be 

5 f 
used to express chimeric toxin is an insect system. In one 

such system, Autographa californica nuclear polyhidrosis 
virus (AcNPV) is used as a vector to express foreign genes. 
The virus grows in Spodoptera frugiperda cells. The chimeric 

toxin coding sequence may be cloned into non-essential 
10 regions (for example the polyhedrin gene) of the virus and 
placed under control of an AcNPV promoter (for example the 
polyhedrin promoter) . Successful insertion of the chimeric 
protein coding sequence will result in inactivation of the 
polyhedrin gene and production of non-occluded recombinant 
virus ( i.e. . virus lacking the proteinaceous coat coded for 
15 by the polyhedrin gene) . These recombinant viruses are then 
used to infect Spodoptera frugiperda cells in which the 

inserted gene is expressed. ( e.g. . see Smith et al., 1983, 

J. Viol. 46:584; Smith, U.S. Patent No. 4,215,051). 

Specific initiation signals may also be required 
for efficient translation of the inserted chimeric toxin 

20 coding sequence. These signals include the ATG initiation 
codon and adjacent sequences. In cases where the entire 
chimeric gene, including its own initiation codon and 
adjacent sequences, is inserted into the appropriate 
expression vector, no additional translational control 
signals may be needed. However, in cases where the chimeric 

25 toxin coding sequence does not include its own initiation 

codon, exogenous translational control signals, including the 
ATG initiation codon, must be provided. Furthermore, the 
initiation codon must be in phase with the reading frame of 
the chimeric protein coding sequence to ensure translation of 
the entire insert. These exogenous translational control 

30 signals and initiation codons can be of a variety of origins, 
both natural and synthetic: - The efficiency of expression may 

- 9 - 
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15 



20 



25 



30 



be enhanced by the inclusion of appropriate transcription 
enhancer elements, transcription terminators, etc. (see 
Bittner et al., 1987, Methods in Enzymol. 153:516-544). 

In addition, a host cell strain may be chosen which 
modulates the expression of the inserted sequences, or^ 
modifies and processes the gene product in the specific 
fashion desired. Such modifications ( e.g. . glycosylation) 
and processing ( e.g. . cleavage) of protein products may be 
important for the function of the protein. Different host 
cells have characteristic and specific mechanisms for the: 
post- trans lational processing and modification of proteins. 
Appropriate cell lines or host systems can be chosen to 
ensure the correct modification and processing of the 
chimeric toxin. To this end, eukaryotic host cells which 
possess the cellular machinery for proper processing of the 
primary transcript, glycosylation, and phosphorylation of the 
chimeric protein may be used. Such mammalian host cells 
include but are not limited to CHO, VERO, BHK, HeLa, COS, 
MDCK, 293, WI38, and the like. 

For long-term, high-yield production of recombinant 
chimeric toxins, stable expression is preferred. For 
example, bacterial host cells or eukaryotic cell lines which 
stably express the chimeric toxins may be engineered. Rather 
than using expression vectors which contain viral origins of 
replication, host cells can be. transformed with a chimeric 
coding sequence controlled by appropriate expression control 
elements ( e.g. . promoter, enhancer, sequences, transcription 
terminators, polyadenylation sites, etc.), and a selectable 
marker. Following the introduction of foreign DNA, 
engineered cells may be allowed to grow for 1-2 days in an 
enriched media, and then are switched to a selective media. 
The selectable marker in the recombinant plasmid confers 
resistance to the selection and allows cells to stably 
integrate the plasmid into their chromosomes and grow to form 
foci which in turn can be cloned and expanded into cell 
lines . 



- 10 - 
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A number of selection systems may be used, 
including but not limited to the herpes simplex virus 
thymidine kinase (Wigler et al . , 1977, Cell 11:223), 
hypoxan thine -guanine phosphor ibosyl trans f erase (Szybalska & 

Szybalski, 1962, Proc. Natl. Acad. Sci. USA 48:2026), and 

5 ' 
adenine phosphoribosyltransf erase (Lowy et al., 1980, Cell 

22:817) genes can be employed in tk", hgprt" or aprt** cells, 
respectively. Also, antimetabolite resistance can be used as 
the basis of selection for dhfr, which confers resistance to 

methotrexate (Wigler et al., 1980, Natl. Acad. Sci. USA 

10 77:3567; O'Hare et a2., 1981, Proc. Natl. Acad. Sci. USA 

78:1527); gpt, which confers resistance to mycophenolic acid 

(Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); 
neo, which confers resistance to the aminoglycoside G-418 

(Colberre-Garapin et al., 1981, J. Mol. Biol. 150:1); and 

15 hygro, which confers resistance to hygromycin (Santerre et 

al., 1984, Gene 30:147) genes. Additional selectable genes 

have been described, namely trpB, which allows cells to 

utilize indole in place of tryptophan; hisD, which allows 

cells to utilize histinol in place of histidine (Hartman & 
2Q Mulligan, 1988, Proc. Natl. Acad. Sci. USA 85:8047); and ODC 

(ornithine decarboxylase) which confers resistance to the 
ornithine decarboxylase inhibitor, 2- (dif luoromethyl) -DL- 
ornithine, DFMO (McConlogue L. , 1987, In: Current 
Communications in Molecular Biology, Cold Spring Harbor 
Laboratory ed . ) . 

25 

5.3. PROTEIN PURIFICATION 

The GnRH-PE chimeric toxins of the invention can be 
purified by art -known techniques such as high performance 
liquid chromatography, ion exchange chromatography, gel 
electrophoresis, affinity chromatography and the like. The 
30 actual conditions used to purify each protein will depend, in 
part, on factors such as net charge, hydrophobic ity, 

- 11 - 
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hydrophilicity, etc., and will be apparent to those having 
skill in the art. 

For affinity chromatography purification, any 
antibody which specifically binds GnRH, PE or a 
conformational epitope created by the fusion of GnRH and PE 
5 may be used. For the production of antibodies , various host 
animals, including but not limited to rabbits, mice, rats, 
etc., may be immunized by injection with GnRH-PE chimeric 
toxin or a portion thereof. The protein may be attached to a 
suitable carrier ,_such as bovine serum albumin (BSA>yby- 
means of a side chain functional group or linkers attached to 

10 a side chain functional group. Various adjuvants may be used 
to increase the immunological response, depending on the host 
species, including but not limited to, Freund f s (complete and 
incomplete) , mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet 

15 hemocyanin, dinitrophenol, and potentially useful human 
adjuvants such as BCG (bacilli Calmette-Guerin) and 
Corynebacterium parvum. 

Monoclonal antibodies to GnRH-PE may be prepared 
using any technique which provides for the production of 
antibody molecules by continuous cell lines in culture. 

20 

These include but are not limited to the hybridoma technique 
originally described by Koehler and Milstein (1975, Nature 
256 -.495-497) . In addition, techniques developed for the 
production of "chimeric antibodies" (Morrison et al., 1984, 

Proc. Natl. Acad. Sci. U.S.A. 81:6851-6855; Neuberger et al., 

25 1984, Nature 312 :604-608; Takeda et al . , 1985, Nature 

314:452-454) by splicing the genes from a mouse antibody 
molecule of appropriate antigen specificity together with 
genes from a human antibody molecule of appropriate 
biological activity can be used. Alternatively, techniques 
described for the production of single chain antibodies (U.S. 
30 Patent No. 4,946,778) can be adapted to produce GnRH-PE- 
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specific single chain antibodies for protein purification and 
detection. 

5.4. CANCER DIAGNOSIS USING GnRH-PE CHIMERIC TOXINS 

The GnRH-PE chimeric toxins of the invention may be 
5 used to detect human tumors in vitro and in vivo. It is 

preferred that such toxins be mutated to abrogate their 
cytotoxic properties without affecting their binding 
specificity for tumor cells. Two examples of such GnRH-PE 
are illustrated in Section 6, infra. The GnRH-PE chimeric 

10 toxins of the invention may be used to detect an epitope 
expressed by a wide variety of human adenocarcinomas, 
including but not limited to, colon adenocarcinoma, breast 
adenocarcinoma, lung adenocarcinoma, ovarian adenocarcinoma, 
endometrial adenocarcinoma, kidney adenocarcinoma, liver 
adenocarcinoma, prostate adenocarcinoma, stomach 

15 adenocarcinoma, cervical adenocarcinoma, gall bladder 

adenocarcinoma and pancreatic adenocarcinoma. The chimeric 
toxins of the invention are particularly useful in 
differentiating adenocarcinomas from non- adenocarcinomas and 
normal cells that express the natural GnRHR. 

2 0 5.4.1. IN VITRO DIAGNOSTIC APPLICATIONS 

The GnRH-PE chimeric toxins of the present 
invention can be used to detect cancer cells in a biological 
specimen such as histological and cytological specimens, and, 
in particular, to distinguish malignant tumors from normal 
tissues and non-malignant tumors for determination of 

25 surgical margin and an improved histological characterization 
of poorly differentiated tumors. Tissue specimens may be 
stained by the chimeric toxins and their binding detected by- 
a secondary antibody specific for a portion of the chimeric 
toxin. The secondary antibody is conjugated to a detectable 
label such as a radioisotope, an enzyme such as peroxidase 

30 and alkaline phosphatase, an ultrasonic probe, a nuclear 
magnetic resonance (NMR) probe, and the like. 
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In addition, immunofluorescence techniques can use 
GnRH-PE to examine human tissue, cell and bodily fluid 
specimens. In a typical protocol, slides containing cryostat 
sections of frozen, unfixed tissue biopsy samples or 
cytological smears are air dried, formalin or acetone fixed, 
5 and incubated with the GnRH-PE in a humidified chamber' at 
room temperature. 

The slides are then washed and further incubated 
with a preparation of a secondary antibody directed against 
GnRH-PE.. The secondary antibody is tagged with a compound 
such as rhodamine, phycoerythrin or fluorescein 
10 isothiocyanate, that fluoresces at a particular wavelength. 
The staining pattern and intensities within the sample are 
then determined by fluorescent light microscopy and 
optionally photographically recorded. 

In another embodiment, computer enhanced 
fluorescence image analysis or flow cytometry can be used to 
15 examine tissue specimens or exfoliated cells, i.e., single 
cell preparations from aspiration biopsies of tumors using 
GnRH-PE. The GnRH-PE chimeric toxins of the invention are 
particularly useful in quantitation of live tumor cells, 
i.e., single cell preparations from aspiration biopsies of 
adenocarcinomas by computer enhanced fluorescence image 
20 analyzer or with a flow cytometer. The percent GnRH-PE-bound 
cell population, alone or in conjunction with determination 
of the DNA ploidy of these cells, may, additionally, provide 
very useful prognostic information by providing an early 
indicator of disease progression. 

The use of GnRH-PE can be extended to the screening 
25 of human biological fluids for the presence of the specific 
antigenic determinants recognized. In vitro 
immunoserological evaluation of biological fluids withdrawn 
from patients thereby permits non- invasive diagnosis of 
cancers. By way of illustration, human bodily fluids such as 

whole blood, pleural effusion fluid, cerebral spinal fluid, 

30 . 

synovial fluid, prostatic fluid, seminal fluid or urine can 
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be taken from a patient and assayed for the specific epitope, 
either as released antigen or membrane -bound on cells in the 
sample fluid, using GnRH-PE in standard radioimmunoassays or 
enzyme-linked immunoassays, competitive binding enzyme-linked 
immunoassays, dot blot or Western blot, or other assays known 
5 in the art. 

Kits containing GnRH-PE can be prepared for in 

vitro diagnosis, prognosis and/or monitoring adenocarcinomas 

by the immunohistological, immunocytological and 

immunoserological methods described above. The components of- 

the kits can be packaged either in aqueous medium or in 

lyophilized form. When the GnRH-PE is used in the kits in 

the form of conjugates in which a label moiety is attached, 

such as an enzyme or a radioactive metal ion, the components 

of such conjugates can be supplied either in fully conjugated 

form, in the form of intermediates or as separate moieties to 

^ be conjugated by the user of the kit. 

A kit may comprise a carrier being 

compartmentalized to receive in close confinement therein one 

or more container means or series of container means such as 

test tubes, vials, flasks, bottles, syringes, or the like. A 

first of said container means or series of container means 

~~ may contain GnRH-PE. A second container means or series of 
20 

container means may contain a label or linker- label 
intermediate capable of binding to GnRH-PE. 

5.4.2. IN VIVO DIAGNOSTIC APPLICATIONS 

GnRH-PE chimeric toxins are also useful for 
25 targeting adenocarcinoma cells in vivo. They can be used for 

tumor localization in the detection and monitoring of primary 
tumors as well as metastases, especially lymph nodes. 
Primary evaluation of the extent of tumor spread may 
influence the choice of therapeutic modalities. Continued 
monitoring of residual tumors may also contribute to better 
30 surveillance and early initiation of salvage therapy. Tagged 
GnRH-PE may also be used intraoperatively for better 
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debulking of a tumor, and minimizes normal tissue destruction 
such as lymph nodes. For these in vivo applications, it is 

preferred that highly purified GnRH-PE be used. 

For in vivo detection and/or monitoring of 

adenocarcinomas, the purified GnRH-PE can be covalently 
attached, either directly or via a linker, to a compound 
which serves as a reporter group to permit imaging of 
specific tissues or organs following administration and 
localization of the conjugates or complexes. A variety of 
different types of substances can serve as the reporter 
group, including such as radiopaque dyes, radioactive metal 
and non-metal isotopes, fluorogenic compounds, fluorescent 
compounds, positron emitting isotopes, non-paramagnetic 
metals, etc. 

Kits for use with such in vivo tumor localization 

methods containing GnRH-PE (or fragments thereof) conjugated 
15 to any of the above types of substances can be prepared. The 
components of the kits can be packaged either in aqueous 
medium or in lyophilized form. When the chimeric toxins are 
used in the kits in the form of conjugates in which a label 
is attached, the components of such conjugates can be 
supplied either in fully conjugated form, in the form of 
20 intermediates or as separate moieties to be conjugated by the 
user of the kit. 

6. EXAMPLE: MUTATED GnRH-PE CHIMERIC TOXINS 

BOUND BUT DIP NOT KILL TUMOR CELLS 

6,1. MATERIALS AND METHODS 

25 6.1.1. CONSTRUCTION OF GnRH-PE CHIMERIC TOXINS 

A plasmid vector carrying a full length PE gene 
{pJY3A1136-l,3) (Chaudhary et al., 1990, J. Biol. Chem. 

265:16306-16310; Neshushtan et al . , 1997, J. Biol. Chem. 

272:11597) was cut with Ndel and Hindi II- A 36 base pair 
3Q (bp) synthetic oligomer flanked by Ndel (5 1 end) and Hindlll 
(3' end) restriction sites was ligated to the vector. This 
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oligomer insert contained a GnRH coding sequence in which the 
encoded amino acid at residue #6 was tryptophan instead of 
glycine. In addition, a sequence encoding a linker Gly-Gly- 
Gly-Gly-Ser (SEQ ID NO: 5) repeated twice was placed between 
the GnRH coding sequence and the PE coding sequence. The 
5 resultant plasmid encoded a chimeric toxin, LGnRH-PE66 [ and 
it was confirmed by restriction endonuclease digestion and 
DNA sequence analysis (Figure 1A and IB) . 

In order to produce a second chimeric toxin, LGnRH- 
PE40, the plasmid vector encoding LG11RH-PE66- was digested 
with Ndel and BamHI and ligated to a Ndel -BamHI 750 bp 
10 fragment obtained from the plasmid PHL-906 (Fishman et al., 

1994, Biochemistry 33:6235-6243) along with the 36 bp 
synthetic oligomer consisting of the GnRH coding sequence 
with tryptophan replacing glycine at the sixth amino acid 
position. A sequence encoding the above linker was again 
placed between the GnRH coding sequence and the PE coding 
15 sequence. The resultant plasmid encoded a chimeric toxin, 
LGnRH-PE40, and it was confirmed by restriction endonuclease 
digestion and DNA sequence analysis (Figure 2) . The toxin 
encoded by this plasmid consisted of domains II and III of 
the full-length PE. 

20 

6.1.2 . GENERATION OF MUTATED 

GnRH-PE CHIMERIC TOXINS 

In order to construct GnRH-PE chimeric toxins that 
were not cytotoxic to human cells, the region in the two 
aforementioned plasmids that encoded 122 amino acids at the 
25 C-terminal end of PE of LGnRH-PE66 and LGnRH-PE40 was excised 
with BamHI and EcoRI and replaced with a corresponding 
fragment which contained a deletion of a single codon 
encoding the amino acid at position 553 of the native- PE 
molecule (Figures 1A, IB and 2) (Fishman et al., 1997, Eur. 

J. Immunol. 27:486; Lukoc et al . , 1988, Infect. Immun. 

30 56:3095). The mutated chimeric toxins are referred to as 
LGnRH-PE66M and LGnRH - P E4 0 M respectively. 
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6.1-3. EXPRESSION OF GnRH-PE CHIMERIC TOXINS 

The plasmids, pVMl and pVM2, encoding the mutated 
GnRH-PE chimeric toxins, LGnRH-PE66M and LGnRH - PE4 0M , 
respectively, were expressed in E. coli strain BL21 (XDE3) . 

The plasmids that encoded LGnRH -PE40 and LGnRH-PE66 were also 
expressed in the same bacteria. The plasmids were 
transferred into E. coli and the cells were grown in medium 

containing ampicillin. After reaching an A6 0O value of 1.5- 
1.7, the cultures were induced at 37°C with 1 mM isopropyl-1- 
thio-3-D-galactopyranoside. The cells were collected by 
^0 centrifugation and the pellet was stored at -70°C for several 
hours . 

A pellet of expressing cells was suspended in lysis 
buffer (50 mM Tris-HCl at pH 8.0, ImM EDTA containing 0.2 
mg/ml lysosyme) , sonicated (three 30 second bursts) and 
centrifuged at 30,000xg for 30 min. The supernatant (soluble 

15 fraction) was removed and kept for analysis. The pellet 

(insoluble fraction) was denatured in extraction buffer (6 M 
guanidinium-HCl, 0.1 M Tris-HCl, pH 8.6, ImM EDTA, 0.05 M 
NaCl, and 10 mM dithiothreitol) and stirred for 30 min at 
4°C. The suspension was cleared by centrifugation at 30000xg 
for 15 min and the pellet discarded. The supernatant was 

20 then dialyzed against 0.1 M Tris-HCl pH 8.0, ImM EDTA, 0.25mM 
NaCl, and 0.25mM L-arginine for 16 hours. The dialyzate was 
centrifuged at 1500 Oxg for 15 min and the resulting 
supernatant (refolding fraction) was used as a source of the 
GnRH-PE chimeric toxins. 

Analysis of the fraction by SDS/PAGE revealed a 

25 major band corresponding to the chimeric toxin. 

Immunoblotting with polyclonal antibodies against PE 
confirmed the production of GnRH-PE chimeric toxins. 

6.1.4. PURIFICATION OF RECOMBINANT 
GnRH-PE CHIMERIC TOXINS 

30 The refolded protein fractions were diluted with 

TE20 buffer (20mM Tris, pH 8.0, ImM EDTA). DEAE Sepharose 
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10 



15 



(Pharmacia, Sweden) was added and stirred for half an hour at 
4°C before being packed into a column. Washing of the column 
was done with 80mM NaCl or 50mM Nacl in TE20 buffer. 
Elution of protein was performed with the linear gradient of 
2 x 200ml of 0.08-0.35M NaCl in TE20 (20mM Tris pH 8.0, ImM 
5 EDTA) buffer. The peak fractions were pooled, dialyzecl 
against phosphate saline buffer and kept in aliquots at 
-20°C. 



6.2. RESULTS 

A recombinant GnRH-PE chimeric toxin, LGnRH-PE66 , 
was produced by fusion of a GnRH coding sequence and a PE 
coding sequence with the insertion of a linker between the 
two moieties. A second GnRH-PE chimeric toxin, LGnRH-PE40, 
was produced in a similar manner except that only domains II 
and III of PE was encoded by the toxin coding sequence. In 
addition, the coding sequences of these two chimeric toxins 
were mutated to result in a single amino acid deletion in the 
PE portion. The mutated chimeric toxins were also expressed 
as recombinant proteins. 

The four GnRH-PE chimeric toxins were purified from 
E* coli lysates and refolded. Since PE kills eukaryotic 

cells by inactivating elongation factor 2 through ADP- 
ribosylation during protein synthesis, the four forms of 
GnRH-PE chimeric toxins were tested in a cell free assay for 
their enzymatic activities in ADP-ribosylation (Chung and 
Collier, 1977, J. Infect. Immun. 16:832-841). While the two 
non-mutated GnRH-PE chimeric toxins, LGnRH-PE40 and LGnRH- 
PE66, exhibited ADP-ribosylation activities, the mutated 
chimeric toxins, LGnRH-PE40M and LGnRH-PE66M, were completely 
inactive in the same assay (Figure 3) . Thus, a single amino _ 
acid substitution in PE abrogated the enzymatic activities of 
the chimeric toxins. 

In addition, all four GnRH-PE chimeric toxins were 
tested for their ability to kill 293 renal adenocarcinoma 



20 



25 



30 

cells. It was shown that only the non-mutated chimeric 
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toxins showed dose -dependent inhibition of protein synthesis 
in the target cells (Figure 4) . However, when the chimeric 
toxins were incubated with the same target cells and their 
binding was detected by a labeled anti-PE antibody and FACS 
analysis, all four toxins were able to bind renal carcinoma 
cells with no binding to control T24A bladder carcinoma 
cells. Therefore, while the mutated GnRH-PE chimeric toxins 
were not able to kill target cells, they retained the ability 
to bind to tumor cells. Such non-cytotoxic chimeric toxins 
are particularly useful for use in cancer diagnosis in vitro 

and in vivo. 

Primary granulosa tumor cells were obtained and 
shown to express GnRHR by PCR using primers corresponding to 
specific portions of the GnRHR. The PCR product in granulosa 
cells was the same size as that obtained from pituitary cells 
which expressed natural GnRHR. In contrast, GnRHR -negative 
cells such as normal human lymphocytes did not produce a 
detectable PCR product. Notwithstanding their expression of 
natural GnRHR, the granulosa cells were not susceptible to 
killing by any of the four GnRH-PE chimeric toxins, 
indicating that the chimeric toxins bind to a new epitope 
expressed by adenocarcinoma cells that is distinct from that 
bound by GnRH itself (Figure 5) . 



The present invention is not to be limited in scope 
by the exemplified embodiments which are intended as 
illustrations of single aspects of the invention and any 
sequences which are functionally equivalent are within the 
scope of the invention. Indeed, various modifications of the 
invention in addition to those shown and described herein 
will become apparent to those skilled in the art from the 
foregoing description and accompanying drawings. Such 
modifications are intended to fall within the scope of the 
appended claims. 
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All publications cited herein are incorporated by 
reference in their entirety. 

5 
10 
15 
20 
25 
30 

- 21 - 



SUBSTITUTE SHEET (RULE 26) 



WO 99/49059 



PCT/1L99/00166 



WHAT IS CLAIMED IS 

1. A method for detecting a tumor cell in a 
biological specimen, comprising contacting the biological 

specimen with a chimeric toxin which comprises gonadotropin 

5 ' 

releasing hormone and Pseudomonas extotoxin A, and detecting 

chimeric toxin-bound cells in the specimen. 

2. The method of Claim 1 in which biological 
specimen contains adenocarcinoma cells. 

10 

3. The method of Claim 2 in which the 
adenocarcinoma cells are selected from a group consisting of 
colon adenocarcinoma, breast adenocarcinoma, lung 
adenocarcinoma, overian adenocarcinoma, endometrial 
adenocarcinoma, kidney adenocarcinoma, liver adenocarcinoma, 
prostate adenocarcinoma, stomach adenocarcinoma, cervical 
adenocarcinoma, gall bladder adenocarcinoma and pancreatic 
adenocarcinoma . 

4 . The method of Claim 1 in which the Pseudomonas 
exotoxin is a full-length toxin. 

20 

5. The method of Claim 1 in which the Pseudomoixae 

exotoxin contains only domains II and III of a full-length 
toxin. 

6 . The method of Claim 1 in which the chimeric 
25 toxin comprises the amino acid sequence as shown in SEQ ID 

N0:2. 

7. The method of Claim 6 in which the chimeric 
toxin is encoded by a polynucleotide which comprises the 
nucleotide sequence as shown in SEQ ID NO:l. 

30 
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8 . The method of Claim 1 in which the chimeric 
toxin comprises the amino acid sequence of SEQ ID NO: 4. 

9 . The method of Claim 8 in which the chimeric 
toxin is encoded by a polynucleotide which comprises the 

5 nucleotide sequence as shown in SEQ ID N0:3. 

10. The method of Claim 1 in which the Pseudomonas 
exotoxin is rendered non- cytotoxic. 

11. The method of Claim 10 in which the 

10 

Pseudomonas exotoxin is rendered non- cytotoxic by deleting an 
amino acid residue. 

12 . The method of Claim 1 in which the chimeric 
toxin comprises the amino acid sequence as shown in SEQ ID 

15 NO: 2 wherein amino acid residue #575 is deleted. 

13. The method of Claim 12 in which the chimeric 
toxin is encoded by a polynucleotide which comprises the 
nucleotide sequence as shown as SEQ ID NO:l wherein 
nucleotides #1822-1824 are deleted. 

20 

14. The method of. Claim 1 in which the chimeric 
toxin comprises the amino acid sequence as shown in SEQ ID 
NO: 4 wherein amino acid residue #336 is deleted. 

15. The method of Claim 14 in which the chimeric 
25 toxin is encoded by a polynucleotide which comprises the 

nucleotide sequence as shown in SEQ ID NO: 3 wherein 
nucleotides #1105-1107 are deleted. 

16. The method of Claim 1 in which the chimeric 
toxin is conjugated to a detectable label. 

30 



- 23 - 



SUBSTITUTE SHEET (RULE 26) 



WO 99/49059 PCT/IL99/001 66 

17. The method of Claim 16 in which the detectable 
label is a radioisotope, a fluorescent dye, an enzyme, an 
ultrasonic probe or a NMR probe. 



18. The method of Claim 1 in which the biological 
specimen is a biopsy specimen. 

19. The method of Claim 1 in which the biological 
specimen is a bodily fluid. 

20. The method of Claim 19 in which the bodily 
fluid is whole blood. 



21. The method of Claim 19 in which the bodily 
fluid is pleural effusion fluid. 

22 . The method of Claim 19 in which the bodily 
fluid is urine. 



23 . A method of detecting a tumor cell in a human 
subject, comprising administering to the subject a chimeric 
toxin which comprises gonadotropin releasing hormone and 
Pseudomonas exotoxin A, and detecting chimeric toxin-bound 

cells in the subject. 

24. The method of Claim 23 in which the subject 
has adenocarcinoma. 



25. The method of Claim 24 in which the 
adenocarcinoma is selected from a group consisting of colon 
adenocarcinoma, breast adenocarcinoma, lung adenocarcinoma, 
overian adenocarcinoma, endometrial adenocarcinoma, kidney 
adenocarcinoma, liver adenocarcinoma, prostate 
adenocarcinoma, stomach adenocarcinoma, cervical 
adenocarcinoma, gall bladder adenocarcinoma and pancreatic 
adenocarcinoma . 
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26. The method of Claim 1 in which the Pseudomonas 
exotoxin is rendered non- cytotoxic. 

27. The method of Claim 26 in which the 
Pseudomonas exotoxin is rendered non- cytotoxic by deleting an 

5 

amino acid residue. 

28. The method of Claim 1 in which the chimeric 
toxin comprises the amino acid sequence as shown in SEQ ID 
HO: 2 wherein amino acid residue #57S is deleted. 

10 

29. The method of Claim 28 in which the chimeric 
toxin is encoded by a polynucleotide which comprises the 
nucleotide sequence as shown in SEQ ID NO:l wherein 
nucleotides #1822-1824 are deleted. 



30. The method of Claim 1 in which the chimeric 
toxin comprises the amino acid sequence as shown in SEQ ID 
NO:4 wherein amino acid residue #336 is deleted. 



31. The method of Claim 30 in which the chimeric 
toxin i3 encoded by a polynucleotide which comprises the 
nucleotide sequence as shown in SEQ ID NO: 3 wherein 
nucleotides #1105-1107 are deleted. 

32. The method of Claim 23 in which the chimeric 
toxin is conjugated to a detectable label. 

„ 33. The method of Claim 32 in which the detectable 

25 

label is a radioisotope, a fluorescent dye, an enzyme, an 
ultrasonic probe or a NMR probe. 

34. A chimeric toxin comprising gonadotropin 
releasing hormone and Pseudomonas exotoxin A, wherein the 

30 toxin binds but does not kill tumor cells. 
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35. The chimeric toxin of Claim 34 which comprises 
the amino acid sequence as shown in SEQ ID NO: 2 wherein the 
amino acid residue #575 is deleted. 

36. The chimeric toxin of Claim 35 which is 

5 encoded by a polynucleotide which comprises the nucleotide 
sequence as shown in SEQ ID NO:l wherein nucleotides #1822- 
1824 are deleted. 



37. The chimeric toxin of Claim 34 which comprises 
the amino acid sequence as shown in SEQ ID NO: 4 wherein the 
10 amino acid residue #336 is deleted. 



38. The chimeric toxin of Claim 37 which is 
encoded by a polynucleotide which comprises the nucleotide 
sequence as shown in SEQ ID NO: 3 wherein nucleotides #1105- 
1107 are deleted. 



20 



25 



30 
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1/7 

100/1 130/11 

ATG gog coc tgg Ice tot tgg ctg cgc cct ggo goo get ggo ggo ggo ggo lec ggo ggo 
MET glu his trp ser tyr trp leu org pro gly glu olo gly gly gly gly ser gly gly 
160/21 190/31 

ggo ggo tec ggt coo get ttc goc etc tgg ooc goo tgc gee ooo gec tgc gtg etc goc 
gly gly ser gly gin olo phe osp leu trp osn glu cys olo lys olo cys vol leu osp 
220/41 250/51 

etc oog goc ggc gtg cgt tec ogc cgc otg ogc gtc goc ccg gee otc gee goc occ ooc 
leu lys osp gly vol org ser ser org met ser vol osp pro olo ile olo osp thr osn 
280/61 310/71 

ggc cog ggc gtg ctg coc toe tec otg gtc ctg gog ggc ggc ooc goc gcg etc gog ctg 
gly gin gly vol leu his tyr ser met vol leu glu gly gly osn osp olo leu glu leu 
340/81 370/91 

gee otc goc ooc gee etc ogc otc occ ogc goc ggc ctg occ otc cgc etc goo ggc ggc 
olo ile osp osn olo leu ser ile thr ser asp gly leu thr ile org leu glu gly gly 
400/101 430/111 

gtc gog ccg ooc oog ccg ctg cgc toe ogc- toe ocg cgc cog gcg cgc ggc ogg tgg teg 
vol glu pro osn lys pro leu org tyr ser tyr thr org gin olo org gly org trp ser 
460/121 490/131 

ctg ooc tgg ctg gta ccg otc ggc coc gog oog ccc teg ooc otc aog gtg ttc otc cac 
leu osn trp leu vol pro ile gly his glu lys pro ser osn ile lys vol phe ile his 
520/141 550/151 

goo ctg oac gee ggc ooc cog etc ogc coc atg teg ccg otc tac occ otc gag otg ggc 
glu leu asn alo gly osn gin leu ser his met ser pro ile tyr thr ile glu met gly 
580/161 610/171 

goc gog ttg ctg gcg aag ctg gcg cgc got gee occ ttc ttc gtc agg gcg coc gag ogc 
asp glu leu leu ola lys leu alo org osp olo thr phe phe vol org olo his glu ser 
640/181 670/191 

aac gog otg cog ccg ocg etc gee ate ogc cat gee ggg gtc ogc gtg gtc otg gee cog 
osn glu met gin pro thr leu ala ile ser his ola gly vol ser vol vol met olo gin 
700/201 730/211 

ace cag ccg cgc egg gao aog cgc tgg ogc gaa tgg gee ogc ggc aog gtg ttg tgc ctg 
thr gin pro org org glu lys org trp ser glu trp olo ser gly lys vol leu cys leu 
760/221 790/231 

etc goc ccg ctg goc ggg gtc toe oac toe etc gec cag coo cgc tgc oac etc goc got 
leu osp pro leu osp gly vol tyr asn tyr leu ala gin gin org cys osn leu osp osp 
820/241 850/251 

ace tgg gao ggc oag otc toe egg gtg etc gec ggc oac ccg gcg aag cat goc ctg goc 
thr trp glu gly lys ile tyr org vol leu olo gly asn pro olo lys his osp leu osp 
880/261 910/271 

otc oao ccc ocg gtc otc ogt goo gog ctg gog ttt ccc gog ggc ggc age ctg gec gcg 
ile lys pro thr vol ile ser glu glu leu glu phe pro glu gly gly ser leu ala olo 
940/281 970/291 

ctg occ gcg cac cog get tgc coc ctg ccg ctg gag act ttc acc cgt cot cgc cog ccg 
leu thr alo his gin olo cys his leu pro leu glu thr phe thr org his org gin pro 

FIG.1A 
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1000/301 

cgc ggc tgg gao 

org gly trp glu 

1060/321 

ctg gcg gcg egg 

leu alo ata org 

1120/341 

ccc ggc age ggc 

pro gly ser gly 

1180/361 

gec ctg occ ctg 

olo leu thr leu 

1240/381 

gag gec ggc gcg 

glu olo gly olo 

1300/401 

tgc gcg ggc ccg 

cys olo gly pro 

1360/421 

gog ttc etc ggc 

glu phe leu gly 

1420/441 

gtg gog egg ctg 

vol glu org leu 

1480/461 

toe coc ggc occ 

tyr his gly thr 

1540/481 

ogc cog goc etc 

ser gin osp leu 

1600/501 

toe ggc toe gec 

tyr gly tyr olo 

1660/521 

ctg egg gtc tot 

leu org vol tyr 

1720/541 

gee gcg ccg gog 

olo olo pro glu 

1780/561 

ctg goc gee otc 

leu osp olo ile 

1840/581 

ccg ctg' gee gog 

pro leu olo glu 
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ggc tot ccg gtg 
gly tyr pro vol 

1090/331 
gtc goc cog gtg 
vol osp gin vol 

1150/351 
gcg otc cgc gag 
olo ile org glu 

1210/371 
gog cgc ttc gtc 
glu org phe vol 

1270/391 
gtg ogc ctg occ 
vol ser leu thr 

1330/411 
gee ctg ctg gog 
olo leu leu glu 

1390/431 
ogc ttc ogc occ 
ser phe ser thr 

1450/451 
coo ctg gag gag 
gin leu glu glu 

1510/471 
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940/281 

CTG CTG CGG GTC 
leu leu org vol 
1000/301 
CTG GCC GCG CCG 
leu olo olo pro 

1060/321 

CGC CTG GAC GCC 

org leu osp olo 

1120/341 

TGG CCG CTG GCC 

trp pro leu olo 

1180/361 

GTC GGC GGC GAC 
vol gly gly asp 
1240/381 

CCG GAC TAC GCC 
pro osp tyr olo 
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thr vol vol 
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CCC GGC AAA 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: Yissum Research Development Company of The Hebrew 
University of Jerusalem 

(ii) TITLE OP INVENTION: METHODS OF CANCER DIAGNOSIS 

USING A CHIMERIC TOXIN 

(iii) NUMBER OP SEQUENCES: 7 

tiv) CORRESPONDENCE ADDRESS; 

(A) ADDRESSES: Pennie * Edmonds, LLP 

(B) STREET: XX55 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: NY 

(E) COUNTRY: USA 
(P) ZIP: 10036-2S11 

(v) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Diskette 
(b) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 

(B> FILING DATE : 
(C) CLASSIFICATION ; 

(vii) PRIOR APPLICATION DATAt 

(A) APPLICATION NUMBER x 09/046,992 

(B) FILING DATE: 24 -MAR- 199 8 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Poissant, Brian M 

(B) REGISTRATION NUMBER i 28,462 

(C) REFERENCE/DOCKET NUMBER: 9457-0013-228 

(iX) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 650-493-4935 
<B) TELEFAX: 650-493-5556 
(C) TELEX; 66141 PENNXS 



(2) INFORMATION FOR 8EQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 1908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: Single 



WO 99/49059 

(D) TOPOLOGY: linear 



PCT/IL99/00166 



(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 1. . .1905 
(D) OTHER INFORMATION : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: 

ATG SAG CAC TGG TCC TAT TOO CTG CGC OCT GGA GAA GCT GGA GGA GGA 48 
Met Glu His Trp Ser Tyr Trp Leu Arg Pro Gly Glu Ala Gly Gly Gly 
15 10 15 

GGA TCC GGA GGA GGA GGA TCC GGT CAA GCT TTC GAC CTC TGG AAC GAA 96 
Gly Ser Gly Gly Gly Gly Ser Gly Gin Ala Phe Asp Leu Trp Asn Glu 
20 25 30 

TCC GCC AAA GCC TGC GTG CTC GAC CTC AAG GAC GGC GTG CGT TCC AGO 144 
Cys Ala Lys Ala cys Val Leu Asp Leu Lys Asp Gly val Arg Ser Ser 
35 40 45 

CGC ATG AGC GTC GAC CCG GCC ATC GCC GAC ACC AAC GGC CAG GGC GTG 192 
Arg Met ser Val Asp Pro Ala lie Ala Asp Thr Asn Gly Gin Gly val 
50 55 €0 

CTG CAC TAC TCC ATG GTC CTG GAG GGC GGC AAC GAC GCG CTC GAG CTG 240 
Leu His Tyr ser Met Val Leu Glu Gly Gly Asn Asp Ala Leu Glu Leu 
65 70 75 80 

GCC ATC GAC AAC GCC CTC AGC ATC ACC AGC GAC GGC CTG ACC ATC CGC 288 
Ala lie Asp Asn Ala Leu Ser lie Thr Ser Asp Gly Leu Thr lie Arg 

85 90 > 95 

CTC GAA GGC GGC GTC GAG CCG AAC AAG CCG CTG CGC TAC AGC TAC ACG 336 
Leu Glu Gly Gly Val Glu Pro Asn Lys Pro Leu Arg Tyr $«r Tyr Thr 
100 10S 110 

CGC CAG GCG CGC GGC AGO TGG TCG CTG AAC TGG CTG GTA CCG ATC GGC 384 
Arg Gin Ala Arg Gly Arg Trp Ser Leu Asn Trp Leu Val Pro He Gly 
115 120 125 

CAC GAG AAG CCC TCG AAC ATC AAG GTG TTC ATC CAC GAA CTG AAC GCC 432 
His Glu Lys Pro Ser Asn He Lys Val Phe He His Glu Leu Asn Ala 
130 135 140 

GGC AAC CAG CTC AGC CAC ATG TCG CCG ATC TAC ACC ATC GAG ATG GGC 480 
Gly Asn Gin Leu Ser His Met Ser pro He Tyr Thr He Glu Met Gly 
145 150 155 160 

GAC GAG TTG CTG GCG AAG CTG GCG CGC GAT GCC ACC TTC TTC GTC AGG 528 
Asp Glu Leu Leu Ala Lys Leu Ala Arg Asp Ala Thr Phe Phe Val Arg 

165 170 175 

GCG CAC GAG AGC AAC GAG ATG CAG CCG ACG CTC GCC ATC AGC CAT GCC 576 
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Ala His Glu Ser Asn Glu Met Gin Pro Thr Leu Ala He Ser His Ala 
180 185 190 

GGG GTC AGC GTQ GTC ATG GCC CAG AAC CAG CCG CGC CGG GAA AAG CGC 624 
Gly val Ser Val val Met Ala Gin Asn Gin Pro Arg Arg Glu Lys Arg 
195 200 205 

TGG AGC GAA TGG GCC AGC GGC AAG GTG TTG TGC CTG CTC GAC CCG CTG 672 
Tip Ser Glu Trp Ala ser Gly Lye Val Leu Cys Leu Leu Asp Pro Leu 
210 215 220 

GAC GGG GTC TAC AAC TAC CTC GCC CAG CAA CGC TGC AAC CTC GAC GAT 720 
Asp Gly Val Tyr Asn Tyr Leu Ala Gin Gin Arg Cys Asn Leu Asp Asp 
225 230 235 240 

ACC TGG GAA GGC AAG ATC TAC CGG GTG CTC GCC GGC AAC CCG GCG AAG 768 
Thr Trp Glu Gly Lys He Tyr Arg Val Leu Ala Gly Asn Pro Ala Lys 

245 250 255 

CAT GAC CTG GAC ATC AAA CCC ACG GTC ATC AGT GAA GAG CTG GAG TTT 816 
His Asp Leu Asp He Lys Pro Thr Val He Ser Glu Glu Leu Glu Phe 
260 265 270 

CCC GAG GGC GGC AGC CTG GCC GCG CTG ACC GCG CAC CAG GCT TGC CAC 864 
pro Glu Gly Gly ser Leu Ala Ala Leu Thr Ala Kis Gin Ala Cys His 
275 * 280 285 

CTG CCG CTG GAG ACT TTC ACC CGT CAT CGC CAG CCG CGC GGC TGG GAA 912 
Leu Pro Leu Glu Thr Phe Thr Arg His Arg Gin Pro Arg Gly Trp Glu 
290 295 300 

CAA CTG GAG CAG TGC GGC TAT CCG GTG CAG CGG CTG GTC GCC CTC TAC 960 
Gin Leu Glu Gin Cys Gly Tyr pro Val Gin Arg Leu Val Ala Leu Tyr 
305 310 315 320 

CTG GCG GCG CGG CTG TCG TGG AAC CAG GTC GAC CAG GTG ATC CGC AAC 1008 
Leu Ala Ala Arg Leu Ser Trp Asn Gin Val Aap Gin Val IX© Arg Asn 

325 330 335 

GCC CTG GCC AGC CCC GGC AGC GGC GGC GAC CTG GGC GAA GCG ATC CGC 1056 
Ala Leu Ala Ser Pro Gly Ser Gly Gly Asp Leu Gly Glu Ala He Arg 
340 345 350 

GAG CAG CCG GAG CAG GCC CGT CTG GCC CTG ACC CTG GCC GCC GCC GAG 1104 
Glu Gin pro Glu Gin Ala Arg Leu Ala Leu Thr Leu Ala Ala Ala Glu 
355 360 365 

AGC GAG CGC TTC GTC CGG CAG GGC ACC GGC AAC GAC GAG GCC GGC GCG 1152 
Ser Glu Arg Phe Val Arg Gin Gly Thr Gly Aen Asp Glu Ala Gly Ala 
370 375 380 

GCC AAC GCC GAC GTG GTG AGC CTG ACC TGC CCG GTC GCC GCC GGT GAA 1200 
Ala Asn Ala Asp Val Val Ser Leu Thr Cys Pro Val Ala Ala Gly Glu 
385 390 395 400 

TGC GCG GGC CCG GCG GAC AGC GGC GAC GCC CTG CTG GAG GCG AAC TAT 1248 
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Cys Ala Gly Pro Ala Asp Ser Gly Asp Ala Leu Leu Glu Ala Asn Tyr 

405 410 415 

CCC ACT GGC GCG GAG TTC CTC GGC GAC GGC GGC GAC GTC AGC TTC AGC 1296 
Pro Thr Gly Ala Glu Phe Leu Gly Asp Gly Gly Asp Val Ser Phe Ser 
420 42S 430 

ACC CGC GGC ACG CAG AAC TGG ACG GTG GAG CGG CTG CTC CAG GCG CAC 1344 
Thr Arg Gly Thr Gin Asn Trp Thr Val Glu Arg Leu Leu Gin Ala Hie 
435 440 445 

CGC CAA CTG GAG GAG CGC GGC TAT GTG TTC GTC GGC TAG CAC GGC ACC 1392 
Arg Gin Leu Glu Glu Arg Gly Tyr Val Phe Val Gly Tyr His Gly Thr 
450 455 460 

TTC CTC GAA GCG GCG CAA AGC ATC GTC TTC GGC GGG GTG CGC GCG CGC 1440 
Phe Leu Glu Ala Ala Gin Ser lie Val Phe Gly Gly Val Arg Ala Arg 
465 470 47S 480 

AGC CAG GAC CTC GAC GCG ATC TGG CGC GGT TTC TAT ATC GCC GGC GAT 1488 
Ser Gin Asp Leu Asp Ala lie Trp Arg Gly Phe Tyr lie Ala Gly Asp 

485 490 495 

CCG GCG CTG GCC TAG GGC TAC GCC CAG GAC CAG GAA CCC GAC GCA CGC 1536 
Pro Ala Leu Ala Tyr Gly Tyr Ala Gin Asp Gin Glu Pro Asp Ala Arg 
500 505 510 

GGC CGG ATC CGC AAC GGT GCC CTG CTG CGG GTC TAT GTG CCG CGC TCG 1584 
Gly Arg He Arg Asn Gly Ala Leu Leu Arg Val Tyr Val Pro Arg ser 
515 520 525 

AGC CTG CCG GGC TTC TAC CGC ACC AGC CTG ACC CTG GCC GCG CCG GAG 1632 
Ser Leu Pro Gly Phe Tyr Arg Thr Ser Leu Thr Leu Ala Ala Pro Glu 
530 535 540 

GCG GCG GGC GAG GTC GAA CGG CTG ATC GGC CAT CCG CTG CCG CTG CGC 1680 
Ala Ala Gly Glu Val Glu Arg Leu Jla Oly Hi a P*o Lou Pro Leu Arg 
545 550 555 560 

CTG GAC GCC ATC ACC GGC CCC GAG GAG GAA GGC GGG CGC CTG GAG ACC 1728 
Leu Asp Ala lie Thr Gly Pro Glu Glu Glu Gly Gly Arg Leu Glu Thr 

565 570 575 

ATT CTC GGC TGG CCG CTG GCC GAG CGC ACC GTG GTG ATT CCC TCG GCG 1776 
He Leu Gly Trp Pro Leu Ala Glu Arg Thr Val Val He Pro Ser Ala 
580 585 590 

ATC CCC ACC GAC CCG CGC AAC GTC GGC GGC GAC CTC GAC CCG TCC AGC 1B24 
He Pro Thr Asp Pro Arg Asn Val Gly Gly Asp Leu Asp Pro Ser Ser 
595 600 605 

ATC CCC GAC AAG GAA CAG GCG ATC AGC GCC CTG CCG GAC TAC GCC AGC 1872 
He Pro Asp Lye Glu Gin Ala He Ser Ala Leu Pro Asp Tyr Ala Ser 
610 615 620 

CAG CCC GGC AAA CCG CCG CGC GAG GAC CTG AAG TAA 1906 
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Gin Pro Gly Lye Pro Pro Arg Glu Asp Leu Lys 
*25 630 635 



(2) INFORMATION FOR SSQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 635 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPB: internal 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 2: 

Met Glu His Trp Ser Tyr Trp Leu Arg Pro Gly Glu Ala Gly Gly Gly 

1 5 io 15 

Gly Ser Gly Gly Gly Gly Ser Gly Cln Ala Phe Asp Leu Trp Asn Glu 

20 25 30 

Cys Ala Lys Ala Cys val Leu Asp Leu Lys Asp Gly val Arg Ser Ser 

35 40 45 

Arg Met Ser Val Asp Pro Ala He Ala Asp Thr Asn Gly Gin Gly Val 

50 55 60 

Leu His Tyr Ser Met Val Leu Gl u Gly Gly Asn Asp Ala Leu Glu Leu 
65 70 75 80 

Ala He Asp Asn Ala Leu Ser lie Thr Ser Asp Gly Leu Thr He Arg 

85 90 95 

Leu Glu Gly Gly Val Glu Pro Asn Lys Pro Leu Arg Tyr Ser Tyr Thr 

ioo 105 110 

Arg Gin Ala Arg Gly Arg Trp Ser Leu Asn Trp Leu val Pro He Gly 

115 120 125 

His Glu Lys Pro Ser Asn He Lys Val Phe He His Glu Leu Asn Ala 

130 135 140 

Gly Asn Gin Leu Ser His Met Ser Pro He Tyr Thr He Glu Met Gly 

145 150 153 160 

Asp Glu Leu Leu Ala Lys Leu Ala Arg Asp Ala Thr Phe Phe val Arg 

165 170 175 

Ala His Glu Ser Asn Glu Met Gin pro Thr Leu Ala He ser His Ala 

ISO 185 190 

Gly Val Ser Val Val Met Ala Gin Asn Gin Pro Arg Arg Glu Lys Arg 

135 200 205 

Trp Ser Glu Trp Ala Ser Gly Lys Val Leu Cys Leu Leu Asp Pro Leu 

210 215 220 

Asp Gly Val Tyr Asn Tyr Leu Ala Gin Gin Arg Cys Asn Leu Asp Asp 
225 230 235 240 

Thr Trp Glu Gly Lys He Tyr Arg Val Leu Ala Gly Asn Pro Ala Lys 

245 250 255 

His Asp Leu Asp He Lys Pro Thr Val He Ser Glu Glu Leu Glu Phe 

260 265 270 

Pro Glu Gly Gly Ser Leu Ala Ala Leu Thr Ala His Gin Ala Cys His 

275 280 285 

Leu Pro Leu Glu Thr Phe Thr Arg His Arg Gin pro Arg Gly Trp Glu 

290 295 300 

Gin Leu Glu Gin Cys Gly Tyr Pro Val Gin Arg Leu Val Ala Leu Tyr 
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1 AC 














Wu iua /US ATy 


T . Bar T |"¥S A on ft! Tl Val hays 
li«U 361 l*p A3u VXU Vol /iSJp 


Gin 


val 




Arg 




Ala Leu Ala Ser 


Pro Gly Ser Gly Gly Asp Leu 


Gly 


Glu 


Ala 


He 


Arg 


340 


"lit R 

349 












fcxXU VzXU *TO UXU 


Uin Ala ATy liBU J bau xuf 






Ala. 


Ala 


Glu 
















oer i^iu ATy 


vox Jiiy Uin v»xy xIjlc vXy ash 


nop 


um 


Aid 




A! A 


"4 *T A 

370 


375 


Sou 










Ala Asn Ala Asp 


vax vax ser xieu Tnr uys vro 




111 sa 

Axa 


Axa 


wxy 


UJ.U 


m pm p 

385 


390 395 










400 


Cys Ala Qly Pro 


Ala Asp Ser Gly Asp Ala Leu 
405 410 


l»eu 


Ml - . 


Axa 


ACTS 

415 


Tyr 


Pro Thr Gly Ala 


Glu Phe Leu Gly Asp Gly Gly 


Asp 


Val 


Ser 


Phe 


Ser 


420 


425 






430 






Thr Arg Gly Thr 


Gla Asn Trp Thr Val Glu Arg 


Leu 


Leu 


Gin 


n 1 _ 

Ala 


Hxs 


435 


440 




445 








Arg Gin Leu Glu 


Glu Arg Gly Tyr Val Phe Val 


Gly 


Tyr 


His 


Gly 


Thr 


450 


455 


460 










Phe i#eu Glu Ala 


Ala Gin Ser lie Val Phe Gly 


Gly 


Vax 


Arg 


Axa 


Arg 


465 


470 475 










480 


Ser Gin Asp Leu 


Asp Ala lie Trp Arg Gly Phe 
485 490 


mM ■ ■ , 

Tyr 


lie 


Ala 


Gly 
495 


Asp 


Pro Ala Leu Ala 


Tyr Gly Tyr Ala Gin Asp Gin 


Glu 


Pro 


Asp 


Ala 


Arg 


500 


505 






510 






Gly Arg lie Arg 


Asn v5xy Axa xveu lieu Arg vax 


W>i»»« 

Tyr 


vax 


rtv 




per 


515 


520 




C*J It 








ser lieu pro uiy 


Pne Tyr Arg; Tnr ser x«eu rnr 


T m* 

XicU. 


211 A 
AJ.4* 


Hla 


Pro 


VI1U 


530 


535 


540 










Ala Ala Gly Glu 


Val Glu Arg Leu He Gly His 


pro 


xveu 


Fro 


X»eu 


Arg 


545 


550 555 












Leu Asp Ala lie 


Thr Gly Pro Glu Glu Glu Gly 
565 570 


Gly 


Arg 


Leu 


Glu 
575 


Thr 


lie Leu Gly Trp 


Pro Leu Ala Glu Arg Thr Val 


Val 


lie 


Pro 


Ser 


Ala 


seo 


585 






590 






lie Pro Thr Asp 


Pro Arg Asn val Gly Gly Asp 


Leu 


Asp 


Pro 


Ser 


Ser 


595 


600 




605 








lie Pro Aep Lys 


Glu Gin Ala He Ser Ala Leu 


Pro 


ASP 


Tyr 


Ala 


Ser 


610 


615 


620 











Gin Pro Gly Lys Pro Pro Arg Glu Asp Leu Lys 
625 €30 635 



(2) INFORMATION FOR SKQ TO NOi3i 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1191 base pairs 

(B) TYPE: nucleic acid 

(C) STRAOTEDNESS: Single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION i 1...1188 
(D) OTHER INFORMATION: 
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(XX ) SEQUENCE DESCRIPTION • SEQ ID NO : 3 : 

ATG GAG CAC TGG TCC TAT TOG CTG CGC OCT GGA GAA GCT GGA GGA GGA 48 
Met Glu His Trp Ser Tyr Trp Leu Arg Pro Gly Glu Ala Gly Gly Gly 
15 10 15 

GGA TCC GGA GGA GGA GGA TCC GCT GAA GCT TTT GTT AAC GCC CAT ATG 96 
Gly Ser Gly Gly Gly Gly Ser Gly Gin Ala Phe Val ash Ala His Met 
20 25 30 

GCC GAA GAG GCC GGC AGC CTG GCC GCG CTG ACC GCG CAC CAG GCT TGC 144 
Ala Glu Glu Gly Gly Ser Leu Ala Ala Leu Thr Ala His Gin Ala Cys 
35 40 45 

CAC CTG CCG CTG GAG ACT TTC ACC CGT CAT CGC CAG CCG CGC GGC TGG 192 
His Leu Pro Leu Glu Thr Phe Thr Arg His Arg Gin Pro Arg Gly Trp 
50 55 60 

GAA CAA CTG GAG CAG TGC GGC TAT CCG CTG CAG CGG CTG GTC GCC CTC 240 
Glu Glu Leu Glu Gin Cys Gly Tyr Pro val Gin Arg Leu Val Ala Leu 
65 70 75 80 

TAC CTG GCG GCG CGG CTG TCG TGG AAC CAG GTC GAC CAG GTG ATC CGC 208 
Tyr Leu Ala Ala Arg Leu ser Trp Asn Gin Val Asp Gin val He Arg 

85 90 95 

AAC GCC CTG GCC AGC CCC GGC AGC GGC GGC GAC CTG GGC GAA GCG ATC 336 
Asn Ala Leu Ala Ser Pro Gly Ser Gly Gly Asp Leu Gly Glu Ala lie 
100 105 110 

CGC GAG CAG CCG GAG CAG GCC CGT CTG GCC CTG ACC CTG GCC GCC GCC 384 
Arg Glu Gin Pro Glu Gin Ala Arg Leu Ala Leu Thr Leu Ala Ala Ala 
115 120 125 

GAG AGC GAG CGC TTC GTC CGG CAG GGC ACC GGC AAC GAC GAG GCC GGC 432 
Glu Ser Glu Arg Phe Val Arg Gin Gly Thr Gly Asn Asp Glu Ala Gly 

130 135 140 

GCG GCC AAC GCC GAC GTG GTG AGC CTG ACC TGC CCG GTC GCC GCC GGT 480 
Ala Ala Asn Ala Asp Val Val Ser Leu Thr Cys Pro Val Ala Ala Gly 
145 150 155 160 

GAA TGC GCG GGC CCG GCG GAC AGC CGC GAC GCC CTG CTG GAG CGC AAC 528 
Glu Cys Ala Gly Pro Ala Asp Ser Gly Asp Ala Leu Leu Glu Arg Asn 

165 170 175 

TAT CCC ACT GGC GCG GAG TTC CTC GGC GAC GGC GGC GAC GTC AGC TTC 576 
Tyr Pro Thr Gly Ala Glu Phe Leu Gly Asp Gly Gly Asp Val Ser Phe 
180 185 150 

AGC ACC CGC GGC ACQ CAG AAC TGG ACQ GTG GAG CGG CTG CTC CAG GCG 624 
Ser Thr Arg Gly Thr Gin Asn Trp Thr Val Glu Arg Leu Leu Gin Ala 
195 200 205 

CAC CGC CAA CTG GAG GAG CGC GGC TAT GTG TTC GTC GGC TAC CAC GGC 672 
His Arg Gin Leu Glu Glu Arg Gly Tyr Val Phe Val Gly Tyr His Gly 
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ACC TTC CTC GAA GCG GCG CAA AGC ATC GTC TTC GGC GGG GTG CGC GCG 720 
Thr Phe Leu Glu Ala Ala Gin Ser He Val Phe Gly Gly Val Arg Ala 
225 230 235 240 

CGC AGC CAG GAC CTC GAC GCG ATC TGG CGC GGT TTC TAT ATC GCC GGC 768 
Arg Ser Gin Asp Leu Asp Ala lie Trp Arg Gly Phe Tyr lie Ala Gly 

245 250 255 

GAT CCG GCG CTG GCC TAC GGC TAC GCC CAG GAC CAG GAA CCC GAC GCA 816 
Asp Pro Ala Leu Ala Tyr Gly Tyr Ala Gin Asp Gin Glu Pro Asp Ala 
260 265 270 

CGC GGC CGG ATC CGC AAC GGT GCC CTG CTG CGG GTC TAT GTG CCG CGC 864 
Arg Gly Arg lie Arg Asn Gly Ala Leu Leu Arg val Tyr Val Pro Arg 
275 280 285 

TCG AGC CTG CCO GGC TTC TAC CGC ACC AGC CTG ACC CTG GCC GCG CCG 912 

Ser ser Leu Pro Gly Phe Tyr Arg Thr Ser Leu Thr Leu Ala Ala Pro 
290 295 300 

GAG GCG GCG GGC GAG GTC GAA CGG CTG ATC GGC CAT CCG CTG CCG CTG 960 
Glu Ala Ala Gly Glu Val Glu Arg Leu He Gly His Pro Leu Pro Leu 

305 310 315 320 

CGC CTG GAC GCC ATC ACC GGC CCC GAG GAG GAA GGC GGG CGC CTG GAG 1008 
Arg Leu Asp Ala lie Thr Gly Pro Glu Glu Glu Gly Gly Arg Leu Glu 

325 330 335 

ACC ATT CTC GGC TGG CCG CTG GCC GAG CGC ACC GTQ CTG ATT CCC TCG 1056 
Thr lie Leu Gly Trp Pro Leu Ala Glu Arg Thr Val Val He Pro Ser 
340 345 350 

GCG ATC CCC ACC GAC CCG CGC AAC GTC GGC GGC GAC CTC GAC CCG TCC 1104 
Ala lie Pro Thr Asp Pro Arg Asn Val Gly Gly Asp Leu Asp Pro Ser 
355 360 365 

AGC ATC CCC GAC AAG GAA CAG GCG ATC AGC GCC CTG CCG GAC TAC GCC 1152 
Ser He Pro Asp Lys Glu Gin Ala He ser Ala Leu Pro Asp Tyr Ala 
370 375 380 

AGC CAG CCC GGC AAA CCG CCG CGC GAG GAC CTG AAG TAA 1191 
Ser Gin Pro Gly Lys Pro Pro Arg Glu Asp Leu Lys 
385 390 395 



(2) INFORMATION FOR SEQ ID N0:4: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 396 amino acids 

(B) TYPE i amino acid 

(C) ST&ANDSDNESS : single 
<D) TOPOLOGY: linear 

(ii) KOLBCOLB TYPE: protein 
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(v) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO ;4; 



Met 


Glu 


His 


Trp 


Ser 


Tyr 


Trp 


Leu Arg 


Pro 


Gly 


Glu Ala Gly Gly Gly 


X 








5 








10 




15 


Gly 


Ser 


Gly 


Gly 


Gly 


Gly 


Ser 


Gly Gin 


Ala 


Phe 


Val Asn Ala His Met 








20 








25 






30 


Ala 


Glu 


Glu 


Gly 


Gly 


Ser 


Leu 


Ala Ala 


Leu 


Thr 


Ala His Gin Ala Cys 






35 










40 






45 


His 


Leu 


Pro 


Leu 


Glu 


Thr 


Phe 


Thr Arg 


His 


Arg 


Gin Pro Arg Gly Trp 




50 










55 








60 


Glu 


Gin 


Leu 


Glu 


Gin 


Cys 


Gly 


Tyr Pro 


val 


Gin 


Arg Leu Val Ala Leu 


65 










70 








75 


B0 


Tyr 


Leu 


Ala 


Ala 


Arg 


Leu 


Ser 


Trp Asn 


Gin 


Val 


Asp Gin Val He Arg 










85 








90 




95 


Asn 


Ala 


Leu 


Ala 


Ser 


Pro 


Gly 


Ser Gly 


Gly 


Asp 


Leu Gly Glu Ala He 








100 








105 






110 


Arg 


Glu 


Gin 


Fro 


Glu 


Gin 


Ala 


Arg Leu 


Ala 


Leu 


Thr Leu Ala Ala Ala 






115 










120 






125 


Glu 


Ser 


Glu 


Arg 


Phe 


Val 


Arg 


Gin Gly 


Thr 


Gly 


Asn Asp Glu Ala Gly 




130 










135 








140 


Ala 


Ala 


Asn 


Ala 


ASP 


Val 


Val 


Ser Leu 


Thr 


Cys 


Pro Val Ala Ala Gly 


145 










150 








155 


160 


Glu 


Cye 


Ala 


Gly 


Pro 


Ala 


Asp 


Ser Gly 


Asp 


Ala 


Leu Leu Glu Arg Asn 










165 








170 




175 


Tyr 


Pro 


Thr 


Gly 


Ala 


Glu 


Phe 


Leu Gly 


ASP 


Gly 


Gly Asp Val ser Phe 








180 








185 






190 


Sex 


Thr 


Arg 


Gly 


Thr 


Gin 


Asn 


Trp Thr 


val 


Glu 


Arg Leu Leu Gin Ala 






195 










200 






205 


Hi* 


Arg 


Gin 


Leu 


Glu 


Glu 


Arg 


Gly Tyr 


Val 


Pho 


Val Gly Tyr His Gly 




210 










215 








220 


Thr 


Phe 


Leu 


Glu 


Ala 


Ala 


Gin 


Ser He 


val 


Phe 


Gly Gly Val Arg Ala 


225 










230 








235 


240 


Arg 


Ser 


Gin 


ASP 


Leu 


Asp 


Ala 


He Trp 


Arg 


Gly 


Phe Tyr He Ala Gly 










245 








250 




255 


Asp 


Pro 


Ala 


Leu 


Ala 


Tyr 


Gly 


Tyr Ala 


Gin 


Asp 


Gin Glu Pro Asp Ala 




• 




260 








265 






270 


Arg 


Gly 


Arg 


He 


Arg 


Asn 


Gly 


Ala Leu 


Leu 


Arg 


Val Tyr Val Pro Arg 






275 










280 






285 


Ser 


Ser 


Leu 


Pro 


Gly 


Phe 


Tyr 


Arg Thr 


Ser 


Leu 


Thr Leu Ala Ala Pro 




250 










295 








300 


Glu 


Ala 


Ala 


Gly 


Glu 


Val 


Glu 


Arg Leu 


He 


Gly 


His Pro Leu Pro Leu 


305 










310 








315 


320 


Arg 


Leu 


Asp 


Ala 


He 


Thr 


Gly 


Pro Glu 


Glu 


GlU 


Gly Gly Arg Leu Glu 










325 








330 




335 


Thr 


He 


Leu 


Gly 


Trp 


Pro 


Leu 


Ala Glu 


Arg 


Thr 


Val Val He Pro Ser 








340 








345 




350 


Ala 


He 


Pro 


Thr 


ASp 


Pro 


Arg 


Asn Val 


Gly 


Gly 


Asp Leu Asp Pro Ser 






355 










360 






365 


Ser 


lie 


Pro 


Asp 


Lys 


Glu 


Gin 


Ala He 


ser 


Ala 


Leu Pro Asp Tyr Ala 




370 










375 








380 


Ser 


Gin 


Pro 


Gly 


Lys 


Pro 


Pro 


Arg Glu 


Asp 


Leu 


Lys 


385 










390 








395 





(2) INFORMATION FOR SEQ ID NO: 5 r 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH t 5 amino acids 

(B) TYPE i amino acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION! SEQ ID NO: 5: 

Gly Gly Gly Gly Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPEt amino acid 

(C) STRANDBDNESS: Single 

(D) topology: linear 



(xi) SEQUENCS DESCRIPTION: SEQ ID NO: 6: 

Glu Gly Lys Ser Ser Gly Ser Gly Ser Glu Ser Lys Val Asp 

I 5 10 

(2) INFORMATION FOR SEQ ID N0:7: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 amino acids 

(B) TYPE; amino aoid 

(C) STRANDEDN8SS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 

Lys Glu Ser Gly Ser val Ser Ser Glu Gin Leu Ala Gin Phe Arg Ser 

15 10 15 

Leu Asp 
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